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SUMMARY 


Current econometric research on the determinants of health care spending is characterized by three serious 
shortcomings: a lack of attention to dynamics, a failure to deal with heterogeneity and a neglect of sensitivity 
testing. The latter shortcoming is a particular problem in an area where there is little theoretical guidance with 
which to judge the validity of estimates. This paper deals with these three problems by adopting recently developed 
techniques for analysing dynamic heterogeneous data fields containing non-stationary variables. This method 
allows for parameter heterogeneity across countries and is preferred to pooling with country specific effects, since 
the latter does not yield consistent estimates in this context, despite its common use in studies of health care 
spending. Long-run and short-run elasticity estimates are found to be sensitive to both model specification and 
sample composition. An attempt is made to compensate for the weak theoretical base by increasing the emphasis 
on statistical and econometric criteria for model evaluation, and by subjecting the results to rigorous sensitivity 


testing. Copyright © 1999 John Wiley & Sons, Ltd. 
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INTRODUCTION 


Joseph Newhouse’s claim, in 1977, that health 
spending was a luxury good, has prompted over 20 
years of empirical research into the determinants of 
health care expenditure [1]. During this time there 
has been little progress beyond the finding that 
variations in per capita national income are closely 
correlated with variations in per capita health 
spending. In particular, there is still considerable 
debate regarding the size of the income elasticity of 
health spending [2]. The failure to progress knowl- 
edge fuels criticism that this type of macro analysis 
is not helpful in answering the important questions 


about health care expenditure, and in particular in 
contributing to policy advice. 

The proliferation of econometric work on health 
care spending is characterized by three serious 
shortcomings. Firstly, a lack of attention to the 
dynamics of the relationship between health spend- 
ing, income and other potential regressors. Sec- 
ondly, a failure to adequately deal with the 
heterogeneity of relationships across countries. 
Thirdly, a neglect of diagnostic and sensitivity 
testing, and a failure to systematically consider the 
robustness of the results. The latter shortcoming is 
a particular problem in an area where there is 
little theoretical guidance against which to judge 
estimates. 
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This paper contributes to knowledge in this 
area by analysing the determinants of health care 
spending as a dynamic, heterogeneous data field. 
‘Data field’ is a term coined by Quah [3] to 
describe a panel data set consisting of a relatively 
small number of cross sections, each with a rela- 
tively long time series dimension. Like the health 
expenditure data, the observations are often mea- 
sured at country level. This can be contrasted with 
traditional panels measured at the level of the 
individual, which have large numbers of cross-sec- 
tional observations measured at very few points in 
time. It should also be noted that unlike many 
traditional panels (for example the British House- 
hold Panel Survey), data fields are not explicitly 
designed as a panel to analyse a specific set of 
issues. Each country collects its data indepen- 
dently of the others and there are questions of 
comparability [4]. 

Data fields provide extremely sophisticated 
data sets. The time series element enables exami- 
nation of dynamic relationships and the combina- 
tion of cross-section variation and long time series 
allow us to consider parameter heterogeneity over 
the cross-section groups. As well as being of value 
to anyone interested in the macroeconomics of 
health care spending, this paper also has a more 
general role in illustrating recent developments in 
the econometric analysis of data fields. Like many 
variables that form the time series in a data field, 
the variables here are non-stationary. These fea- 
tures warrant particular consideration, and up to 
now there has been a shortage of empirical appli- 
cations to illustrate the theoretical developments. 

The first two sections of this paper consider 
why the relationships between health spending, 
income and a number of other potential regres- 
sors are dynamic and heterogeneous. The third 
section considers the appropriate estimation tech- 
niques for dynamic, heterogeneous relationships 
involving non-stationary variables. Here we see 
that pooling the data, even after allowing for 
fixed effects, does not result in consistent parame- 
ter estimates in the dynamic case. 

The data, method and initial results are out- 
lined in the fourth section. In the fifth section 
these results are subject to sensitivity testing. This 
reveals that long-run elasticity estimates are sensi- 
tive to model specification, particularly to the 
inclusion of a time trend. It is difficult to obtain 
robust estimates of short-run effects, but these are 
important and appear to be consistent with a 
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long-run equilibrium relationship between health 
spending and income. A concluding section sum- 
marizes the findings and discusses the implica- 
tions for policy and future research. 


HETEROGENEITY IN HEALTH CARE 
SPENDING 


Early cross-section studies of health care spending 
have attracted a number of criticisms of which 
Parkin et al. [5] and Gerdtham [6] provide a good 
discussion. Of current relevance are those criti- 
cisms relating to the imposition of unrealistic 
assumptions regarding the homogeneity of tastes, 
preferences and production functions across coun- 
tries. Health systems differ across countries ac- 
cording to economic, social, political and cultural 
factors; as well as beliefs about medicine and 
health. In an econometric framework, the prob- 
lem centres on the non-observability of many of 
these effects, which are likely to be correlated with 
the observed regressors, and therefore lead to 
specification bias in the estimated coefficients. 

Methods of dealing with parameter heterogene- 

ity are still underdeveloped in the health care 
expenditure literature. Consider a general model 
of the form: 
Nin = B(B' Xi) + En i=1,2,...,N3 
$= 12 ed. (1) 
where h,, is total health expenditure per capita for 
country jin time ¢, and x is a vector of regressors. 
This vector always includes income per capita, but 
the other columns vary across the studies in ques- 
tion, including such things as the proportion of 
the population aged over 65, the proportion of 
public financing of health care and the relative 
price of health care services. While the models are 
essentially ad hoc, the choice of right hand side 
variables is influenced by the vast discursive liter- 
ature on the possible determinants of health care 
spending [7—9]. The log-linear functional form, g, 
is popular because the coefficient estimates can be 
interpreted as elasticities. Income and expenditure 
data for the different countries are generally ad- 
justed using purchasing power parities. 

If ¢ is a single point in time, model (1) repre- 
sents a cross section, and implicitly imposes the 
homogeneity assumptions. This is unrealistic and 
imposes invalid restrictions on the parameters of 
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model (1). Heterogeneity is present across coun- 
tries, despite the fact that almost all of the empir- 
ical work is limited to the OECD. Early attempts 
to account for heterogeneity involved the inclu- 
sion of dummy variables to reflect various institu- 
tional characteristics. For example, Leu [10] 
included dummy variables for the centralized na- 
tional health systems of the United Kingdom and 
New Zealand, and estimated that health expendi- 
ture in these systems was 20% lower than the 
other OECD countries, even after income and 
demographic composition were taken into 
account. 

This method of allowing for varying intercepts 
extends to the panel data framework that has 
recently been adopted in the analysis of health 
care spending. Rather than specifying dummy 
variables a priori in what can be a rather ad hoc 
fashion, the method allows for country specific 
intercepts. The errors from model (1) are defined 
as: 


Ey = Mi + U;,, (2) 


where yi; is a country specific fixed effect term, 
and u,, can be thought of as the usual stochastic 
error term [11]. 

Some pooled studies fail to test Equation (1) 
against this more general fixed effect model 
{12,13]. This casts doubt on their results, since a 
number of studies have shown that the pool re- 
striction is invalid, and that inclusion of fixed 
effects makes a significant difference to the esti- 
mated slope coefficients [6,14—16]. 

Only a very small number of studies have gone 
beyond country specific intercepts, and attempted 
to deal with the possibility of slope heterogeneity. 
Pfaff [17] divides the OECD countries into two 
groups according to the mode of finance (mainly 
taxed-based or mainly contributory financed sys- 
tems), and finds that the income elasticities of 
health care spending are generally higher in the 
tax-financed systems. O’Connell [15] uses country 
specific demographic variables and finds that the 
effect of the ageing population varies substantially 
across the OECD countries. Gerdtham [6] takes a 
systematic look at the effect of sample composi- 
tion by dropping each country in turn, and finds 
significant parameter variation, especially in rela- 
tion to the income elasticity of health care 
spending. 

Analysing individual country time series is an 
extreme method of accounting for parameter het- 
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erogeneity. Both Murillo et al. [18] and Saez and 
Murillo [19] rule out the use of pooled data 
because of the invalidity of the homogeneity as- 
sumptions. However, this type of analysis does 
not take full advantage of the information pro- 
vided by data fields such as the OECD health 
data set. The individual time series solution re- 
sults in small sample sizes and does not allow us 
to estimate the average affect of say income on 
health spending across the OECD. 


THE DYNAMICS OF HEALTH CARE 
SPENDING 


Both the cross section and early panel studies of 
health spending assumed that all effects of in- 
come, demographics, relative price, etc., occurred 
in the current period. However, there are a num- 
ber of reasons to expect lags in the effects of 
various variables. Consumption theory suggests 
that health expenditure will relate to both perma- 
nent and transitory components of income. The 
actual process of expenditure adjustment is not 
well understood and depends on many factors 
including organizational dynamics, technological 
change and expectations [20]. Public expenditure 
on health care is likely to be influenced by elec- 
toral cycles. Expenditure, particular on capital 
goods, is committed over long periods. Labour 
costs are a high proportion of total health spend- 
ing and it has been shown that institutional rigid- 
ities in the US health system, delay adjustment of 
health services employment to macroeconomic 
conditions and other shocks [21]. This tendency is 
likely to be even greater in other OECD countries 
with a higher proportion of health employment in 
the public sector. 

In the recent literature consideration of dynam- 
ics in health spending has been tied up with the 
issue of non-stationarity, and the possibility of 
cointegrating relationships. In summary the re- 
gression of non-stationary variables is likely to 
result in a spurious regression problem, by which 
the apparent close correlation between health care 
expenditure and, say, per capita income is the 
result of common trends rather than evidence of 
any real economic relationship. Non-stationarity 
can arise from deterministic and/or stochastic 
trends in the data, and if variables possess a 
stochastic trend component they should be dif- 
ferenced to impose stationarity and to avoid 
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misleading inference; these variables are said to be 
integrated of order one, or /(1). In the special case 
where a linear combination of non-stationary 
variables is itself stationary, then this combination 
represents the cointegrating, or long-run, relation- 
ship, which can be specified in levels, with short- 
run dynamics modelled via an error correction 
process [22]. 

There is substantial evidence that many of the 
series commonly used to investigate the determi- 
nants of health care spending are non-stationary 
[14,16,23]. A recent paper by McCoskey and 
Selden [24] provides contradictory evidence. How- 
ever, their findings are questioned by Hansen and 
King [25], and turn out to be very sensitive to the 
exclusion of a time trend from their testing equa- 
tions; an issue which is discussed in more detail in 
the results section below. The evidence on cointe- 
gration is less conclusive [14,16,18,19,23]. The 
contradictory evidence is hardly surprising given 
the relatively short time series employed, and the 
many methods of testing for cointegration. It is 
also questionable, how useful the tests are in the 
current context. Intuitively there must be a long- 
run relationship between health spending and 
income, with a long-run elasticity of at most 
unity. Studies that focus simply on detecting 
a cointegrating relationship are only concerned 
with long-run inference, and neglect short- 
run dynamics, which are likely to be more rele- 
vant to policy formulation. The dynamic structure 
of the models is simplified, and many of the 
studies have had to allow for serially correlated 
errors in order to cope with omitted dynamics 
[13,15,16]. 

A small number of papers have considered 
short-term effects by estimating error correction 
models (ECMs) [6,18,19]. Gerdtham [6] estimates 
a first order autoregressive distributed lag 
(ADL) model with fixed effects, for a 22 country 
OECD panel, 1970 to 1987. He tests a variety 
of restrictions of this general model and prefers 
an ECM with long-run restrictions on the age 
and finance variables and a zero restriction on 
price effects. This model imposes a_ unitary 
long-run income elasticity on health spending, 
and his estimates suggest that health spending 
adjusts towards its equilibrium value at a rate 
of approximately 22% per year, although this 
estimate is sensitive to sample composition, 
illustrating the importance of parameter hetero- 
geneity. 
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ANALYSIS OF DYNAMIC 
HETEROGENEOUS DATA FIELDS 


The panel econometrics literature tends to focus 
on intercept variation, with little attention paid to 
the possibility of variation in slope coefficients 
across the groups involved. This may be due to 
the fact that when the main estimators were devel- 
oped, most available panels only had a relatively 
short time dimension, hence, it was not possible 
to investigate slope heterogeneity. However, it has 
also been suggested that slope heterogeneity does 
not matter when we are interested in the average 
effect of an exogenous variable on the dependent 
variable [26], and further that long-run equi- 
librium responses are less likely to be subject to 
slope heterogeneity than short run effects. While 
the former is true in the static case with strictly 
exogenous regressors and coefficients varying ran- 
domly across groups, it does not extend to dy- 
namic models containing a lagged dependent 
variable. Substantial biases can occur in the esti- 
mate of the average long-run effect when a com- 
mon slope is incorrectly imposed on a 
heterogeneous model. 

Two recent papers discuss the appropriate esti- 
mation procedures for dynamic heterogeneous 
panels, such as the OECD health expenditure data 
field [27,28]. To summarize the main findings, if 
both WN and T are large there are four alternative 
procedures for estimating the average effect of 
some exogenous variable (say per capita GDP) on 
a dependent variable (per capita health spending). 
Firstly, the mean group estimator, in which sepa- 
rate time series regressions are estimated for each 
group and the coefficients are averaged across 
groups. Secondly, the pooled estimator in which 
data is pooled by imposing common slopes and 
allowing for country specific intercepts, as in 
Equation (2). Thirdly, aggregate time series re- 
gressions on the data averaged across countries. 
Finally, cross-section regressions with the data 
averaged over time. 

In the static case with strictly exogenous regres- 
sors and coefficients varying randomly across 
groups, all four procedures provide unbiased, 
consistent estimates of the average effects. How- 
ever, in the dynamic case only the mean group 
and cross-section estimators are consistent. This is 
true even if the underlying variables are J(1), as 
long as the individual group relationships are 
cointegrated. It should be noted that for the 
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cross-section estimator the data must be averaged 
over relatively long time periods. The early cross- 
section work on health expenditure, with data 
from just 1 or 2 years [1,5,10], does not yield 
consistent estimates. 

The aggregate time series estimator has not 
been used in health expenditure studies and will 
not be discussed further here. The pooled estima- 
tor has been commonly employed in recent work, 
but pooling fails to produce consistent estimates 
in the dynamic case, even if country specific ef- 
fects are allowed for. The results of Gerdtham’s 
fixed effects estimation for the ECM [6] should 
therefore be treated with caution, since this esti- 
mator tends to underestimate the average short- 
run effect and overestimate the average long-run 
effect [27]. A further note of caution is that under 
slope heterogeneity with /(1) data the pooled esti- 
mates will be inconsistent even if the relationship 
for each group cointegrates. The pooled estimator 
Imposes a common slope parameter, therefore 
introduces an /(1) component in the disturbances, 
thus the pooled model will not constitute a cointe- 
grating regression. This casts doubt on the results 
of those studies that have pooled non-stationary 
series [6,12,13,16]. 

In practice the mean group estimator is rarely 
used in macroeconometrics, and there are no ex- 
amples of its use in the analysis of health care 
spending. However, it is relatively straightforward 
to estimate and can be employed in a dynamic 
framework using a random coefficients model, 
such as that suggested by Pesaran and Smith [27] 
and Pesaran et al. [28]. Assuming, for simplicity, a 
first order heterogeneous dynamic model for 
health spending with only one explanatory vari- 
able, x, 


hig = lg; + Ay —1 + BoiXin + BriXie—1 + be 
$= 152. NG PSAs ct Te (3) 
All variables are measured in log form, and all 


coefficients vary across countries via a random 
coefficients model: 


Boi = Bot Nop etc., (4) 


where 7,;, >; etc., are assumed to have zero 
means and constant covariances. The long-run 
elasticity of h; with respect to x; is: 


9 but Bu 


: . 
1— «a, 


%;,=% + Mi 


(5) 
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Allowing these long-run elasticities to vary ran- 
domly gives, 0;=0+ 6, where €; is a stochastic 
term with a zero mean. The appropriate measure 
of the average long-run effect of x; on h; is given 
by the average of these long-run elasticities: 


» 9; (6) 


This estimator has the limit E(0;) = E(Bo;+ Bi;/ 
1 — a@,;), which is the same as the probability limit 
of the cross-section estimator. 

In many data fields, including that commonly 
used to analyse health spending, the variables in 
question are J(1). Pesaran and Smith [27] show 
that assuming a single cointegrating relationship 
between h,, and x,, for each country, then a static 
regression will provide superconsistent estimates 
of the long-run elasticities, which could then be 
averaged across countries. Assuming the x’s are 
strictly exogenous, the cross-section estimator is 
also consistent in the presence of /(1) variables, 
and the appropriate standard errors in this case 
are White’s adjusted [29]. 

One problem with deriving long-run elasticity 
estimates from an ADL model such as (3), is that 
it is not straightforward to calculate a standard 
error for the elasticity estimate since it involves 
the ratio of two estimates. In order to overcome 
this problem, Wickens and Breusch [30] suggest a 
reparameterization of the ADL model, based on 
the Bewlay transformation [31], to provide direct 
estimates of the 0,’s and the associated standard 
errors. The reparameterization involves subtract- 
ing «,,1,, from both sides, rearranging the x’s and 
renormalizing to give: 


Nig = Ag; + Ay; Ay — 1 + doiXie + 1; AX —1 + Cie 
(7) 


The transformed variables are not independent of 
€;, $O estimation is via instrumental variables 
(IV), and the appropriate instruments are the 
explanatory variables from the ADL model (3). It 
then follows that bo;=0,, i.e. that the estimated 
coefficient on x,, is a direct estimate of the long- 
run elasticity of h, with respect to x,, and the 
standard error is calculated in the normal manner. 

The alternative formulation of the dynamic 
model, (7), also has particularly useful properties 
when the variables in question are non-stationary 
and cointegrated. Estimates of the long-run effects 
will be consistent even if the underlying dynamic 
structure is misspecified by omitting difference 
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terms. This is equivalent to Engle and Grangers 
finding that a cointegrating relationship can be 
obtained from a model, which is misspecified due 
to the omission of short-run dynamics [22]. 

The reparameterization suggested by Wickens 
and Breusch separates long run and short-run 
dynamics and has advantages over the ECM. The 
ECM restricts the long-run elasticity, in (3), to 
unity, Le. o);+fo;+f,;;=1, and requires the 
short-run dynamics to be consistent with the long- 
run solution. Thus, we can rewrite (3): 


Ahj, = Bo; AX — A = %,)i—1 — Xn) + Er — (8) 


The error correction term (h;,_;—xX;,_,) 1s the 
short-run adjustment to equilibrium. More gener- 
ally we can remove the unitary long-run elasticity, 
and rewrite (8): 


Ahi, = Bo; AX — 1. =O) Mie — 1 — 9% — 1) + Fie 
(9) 


This is simply another expression for the ADL 
model, that does not impose any coefficient 
restrictions. 

Equation (9) is the specification employed by 
Murillo et al. [18], and also, presumably Saez and 
Murillo [19], although they do not state this. 
However, it is not a convenient form to estimate 
the long-run elasticity, 0, or the short-run dynam- 
ics. It is better to use the alternative formulation, 
(7). This model can then be subject to conven- 
tional tests of misspecification, which can confirm 
the statistical properties of the estimates of both 
the long run and short-run effects. 


DATA, METHOD AND RESULTS 


The dynamic heterogeneous relationship between 
health care spending and other potential regres- 
sors, is investigated using annual data for 20 
OECD countries, 1960 to 1993. All data is from 
the OECD Health Data CD-ROM [32], and all 
variables are in natural logarithms. In order to 
facilitate comparisons with existing studies the 
choice of regressors has been made on the basis of 
past literature and a detailed justification will not 
be repeated here (see Gerdtham ef al. [9] for a 
good discussion). 

his health care expenditure and y is GDP, both 
are per capita, measured in constant US dollar 
prices, and adjusted via purchasing power pari- 
ties. p is the percentage of publicly funded health 
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care spending; there is contrary theory and evi- 
dence in relation to both the size and direction of 
its effects on total health spending. a is the per- 
centage of the population aged over 65 years, 
included to account for the fact that ageing popu- 
lations generally use more health resources. r is 
the relative price of health care, i.e. the ratio of 
the health price index to the GDP deflator. This is 
included to separate price and income effects and 
see if richer countries actually purchase more 
resources or simply pay higher factor prices. 

The time trend, ¢, is a simple proxy for techni- 
cal change in the absence of any other usable 
measure. Technical progress may reduce the cost 
of producing a given output, however, it also 
increases the effectiveness of medical interventions 
producing both demand and supply side reasons 
for increasing expenditure. In reality the time 
trend will account for any non-observed trended 
effects, and its coefficient is unlikely to be robust 
due to collinearity with other trended variables. 

The models of health spending adopted in the 
literature are ad hoc, and this has implications for 
the econometric analysis. A weak theoretical base 
gives little guidance as to the possible explanatory 
variables and the causal mechanisms involved. 
Traditionally econometric results are judged 
against three sets of criteria. Theoretical criteria 
involve the comparison of results with a priori 
expectations. Statistical criteria involve tests for 
individual and joint significance of parameter esti- 
mates. Econometric criteria involve tests of the 
assumptions underlying estimation, and enable 
evaluation of the properties of the estimators. A 
lack of theoretical guidance necessarily increases 
the emphasis on statistical and econometric crite- 
ria, and ideally results should be subject to strin- 
gent sensitivity analysis in order to establish 
robustness in the absence of theoretical expecta- 
tions. Unfortunately this has not been the case 
with much of the existing literature. 

In preparation for structural modelling all the 
variables have been tested for unit roots using 
augmented Dickey—Fuller (ADF) tests [33], 
Phillips—Perron tests [34] and the ‘t-bar test’ [35], 
which exploits the panel structure of the data. All 
results are sensitive to the inclusion of a time 
trend in the testing equations. When a time trend 
is included, the results are robust to the number 
of lags and to demeaning (which minimizes the 
effect of structural breaks and inappropriately 
specified trends), and provide virtually no evidence 
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to reject the null hypothesis of a unit root, for 
any variable for any country. Without a time 
trend the unit root null is rejected for / and p in 
almost half of the countries (and for the t-bar 
test), although not for the demeaned series. The 
presence of a time trend in the underlying series 
was investigated using the sequential testing pro- 
cedure suggested by Holden and Perman [36]. 
This confirmed that most series were character- 
ized by both a unit root and a linear trend. 
Overall the above evidence seems to suggest that 
for all countries the variables are J(1), and these 
results confirm those of previous studies [16,23]. 
However, in the analysis that follows particular 
attention will be paid to the sensitivity of the 
results to the time trend. 

A first order ADL model with random coeffi- 
cients forms the basis of analysis: 


Nig = Og; + Uy — 1 + ait + Bot BiVi—1 + SP 
+O Pir + YoiMie + MiG —1 + Aol ie + Ali —1 


ye PAD NS PSA ce 
(10) 


All coefficients vary randomly across countries, i, 
as defined in (4). 

Table 1 reports the long-run elasticity estimates 
from a variety of estimators. Those in the first 
column are obtained from the Wickens and 
Breusch reparameterization of (10) estimated via 
IV. They are the means of the country specific 
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long-run estimates defined by (6). These estimates 
are superconsistent when the variables in question 
are I(1). The income elasticity estimate is larger 
than usually reported in the literature. The other 
regressors do not have significant long-run effects. 

Column (2) of Table 1 shows the long-run 
estimates derived from a static model. The income 
elasticity is much lower than that obtained from 
the dynamic model but is still significantly greater 
than unity at f)9,. In addition p and r both have 
significant long-run effects. However, the stan- 
dard errors may be misleading because they are 
based on inappropriate estimates of the variance. 
The properties of these estimates depend on 
whether this model actually represents a cointe- 
grating relationship, and evidence is inconclusive. 
Using the Engle—Granger procedure [22], the null 
of non-stationary residuals, i.e. no cointegration, 
is rejected in only four cases. Using the t-bar test 
[35] to exploit the panel structure of the data, 
gives a statistic of — 3.712, which rejects the unit 
root null at the 1% level. The multivariate Jo- 
hansen procedure [37] finds at least one cointe- 
grating vector for all but one country. The 
Pesaran et al. test, which is valid when the order 
of integration of the underlying regressors is un- 
known, only rejects the null of no long-run rela- 
tionship in five cases [38]. 

Estimates for the pooled model with fixed ef- 
fects are given in the second section of Table 1. 
The estimates in (3) are derived from a dynamic 


Table 1. Long-run elasticity estimates from alternative estimators 


Mean group Pooled Cross section 
(1) Qe (3) (4) (5)° 
y 2.041* 1.254* 1.875 1.515* 1.514* 
(0.428) (0.079) (0.056) (0.161) 
Pp 0.704 0.759* 0.523 0.417* —0.128 
(0.405) (0.089) (0.052) (0.214) 
a —0.895 —0.397 —0.830 —0.019 0.060 
(0.698) (0.224) (0.080) (0.220) 
r 0.215 0.332* 0.466 0.329* —0.427 
(0.249) (0.076) (0.045) (0.399) 


All regressions for i = 20, t = 34 (1960-1993). (1) and (3) ADL models, (2) and (4) static models. 


* Statistically significant at fo 95. 


@ Standard errors for the mean group estimator are computed assuming that regression coeffi- 
cients are independently distributed across countries. 
> Standard errors for the cointegrating regressions are based on inappropriate estimates of the 


variance. 
© White’s adjusted standard errors. 


Copyright © 1999 John Wiley & Sons, Ltd. 


Health Econ. 8: 459-472 (1999) 


466 


J. ROBERTS 


Table 2. Distribution of estimates underlying the mean group estimates 


y Pp a r t 
Mean 2.041* 0.704 —0.895 0.215 —0.024 
(standard error) (0.428) (0.405) —-(0.698) (0.215) (0.014) 
Median 1.380 0.516 —0.583 0.277 —0.014 
Standard deviation 2.180 2.134 2.877 0.954 0.075 
Minimum —2.210 —3.515 —6.025 — 1.395 —0.194 
Maximum 7.190 6.109 2.826 2.152 0.133 


* Statistically significant at fo 95. 
Estimates correspond to Table 1, column (1). 


model, and those in (4) are from the static 
model. These estimates are not consistent but 
are provided for comparison, since they are 
commonly employed in the health expenditure 
literature. These results are similar to those 
derived from the mean group estimators. How- 
ever, the static model will not constitute a 
cointegrating relationship even if individual 
country relationships cointegrate, due to the im- 
position of a common cointegrating vector 
across countries. 

The last column, (5), gives the results from a 
cross-section regression using averages for 1960 
to 1993. This model provides consistent esti- 
mates of the long-run effects. However, com- 
pared with the estimates in columns (1) and (2), 
apart from the income elasticity, the cross sec- 
tion does not provide good point estimates of 
the mean of the long-run coefficients. This may 
be due to interaction effects between the groups. 
The long-run income elasticity is significantly 
greater than one, but for all the other coeffi- 
cients the signs are reversed, although none of 
these estimates are statistically significant. 

Overall the results are fairly similar across the 
estimators. All of the estimates of the long-run 
income elasticity of health spending are signifi- 
cantly greater than one, and the signs of the 
other long-run effects are quite robust to model 
specification, except in the cross-section case. Of 
the consistent estimators, the mean group esti- 
mator is preferred to the cross section because 
heterogeneity is explicit. All of the random coef- 
ficients models were tested against the null hy- 
pothesis of no slope variation and this was 
rejected in every case. Despite the contradictory 
evidence on cointegration, there is little to 
choose between the results from the static and 
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dynamic models. This robustness across specifi- 
cation may lead us to have more confidence in 
the results. 

It appears that the long-run income elasticity 
of health spending is greater than one. The posi- 
tive coefficient on p, suggests that a 10% in- 
crease in the share of public spending, increases 
total health spending by about 7%; a result sim- 
ilar to that found in Leu’s early cross-section 
study [10]. The relative price effect is positive 
suggesting that total health demand is not con- 
strained by price increases, as would be the case 
in a fixed budget regime. However, this coeffi- 
cient is not statistically significant in all of the 
models. The effect of the ageing population is 
not significant, and the negative sign is contrary 
to expectations. 

The individual country regressions were tested 
for heteroscedasticity using White’s test [29], au- 
tocorrelation using the Breusch—Godfrey test 
[39], and normality of the residuals using the 
Jarque—Bera test [40]. Stability of the estimates 
was also tested using the Chow forecast test 
with the sample split in 1975 [41]. Overall the 
underlying regressions appear to be well speci- 
fied, and this confirms the statistical properties 
of the long-run estimates. The few specification 
problems that do exist are associated with four 
countries for which the first order ADL model 
is rejected as a restriction on a more general 
model. For Canada and France the problems 
are solved with a second order ADL model, and 
for Norway by including a second lagged depen- 
dent variable. As predicted by the theory, these 
changes make little difference to the estimated 
long-run elasticities for these countries. It 
proved impossible to find a model for Belgium 
that satisfied all of the diagnostic tests. 
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SENSITIVITY ANALYSIS 


The heterogeneity of the relationships underlying 
the mean group estimates is shown in Table 2, 
which reports the distribution of estimates across 
countries. All distributions have large standard 
deviations, and span the negative/positive range. 
The distribution of the long-run income elasticity 
estimate is highly skewed, and this is caused by 
five particularly high estimates for Belgium, Ger- 
many, Ireland, Italy and the Netherlands. In con- 
trast to the findings of Saez and Murillo [19] there 
does not appear to be any relationship between 
the estimated income elasticity and the level of 
income in a country (correlation coefficient of 
— 0.27). Despite the heterogeneity, the average 
long-run estimates are robust to the sample of 
countries used in the averaging process. This is 
illustrated in Figure l(a), which shows the effect 
of excluding each country in turn on the average 
estimate of the elasticities. 


Long Run Elasticity 
oS 
ao 
! 
! 
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Parameter stability through time is investigated 
by estimating the cross section recursively. Plots 
of these recursive estimates, shown in Figure 2, 
reveal systematic variation over the time period. 
The income elasticity estimate seems to fall 
throughout the 1970s and has been rising since 
the late 1980s. Interestingly, there does not appear 
to be a significant effect from the severe recession 
of the mid-1970s. The most striking result is that 
for the long-run relative price effect (7) which has 
been falling throughout the period from 0.245 to 
— 0.427. 

The results are extremely sensitive to the exclu- 
sion of a time trend, as shown in Table 3. The 
first column (la) shows the dynamic model esti- 
mated without a time trend. None of the estimates 
are very precise and the reason for this is illus- 
trated by Figure 1(b), which shows the sensitivity 
of this model to the cross section of countries 
used in the averaging process. All the estimated 
average long-run effects are extremely sensitive to 
the exclusion of Italy. The second column of 


country excluded 
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Figure 1. Dynamic model: (a) cross-section sensitivity with 7; (b) cross-section sensitivity with no T 
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Figure 2. Recursive estimates of long-run coefficients 


Table 3, (1b), shows the results for the same 
model when Italy is excluded, and these results 
are much more in line with those reported in 
Table 1, column (1). 

The question of whether or not to include a 
time trend in the dynamic model is obviously 
crucial. The trend term is only statistically signifi- 
cant (fo9s) in five of the 20 individual country 
regressions. However, as Table 2 shows, the mean 
estimate is significant at fo ;, and it suggests that 
once differences in income, public spending, age 
and relative prices are taken into account, the 
average tendency is for health spending to fall by 
2.4% per year. The insignificance of ¢ in the 
underlying regressions, combined with disruption 
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to the other coefficient estimates when it is ex- 
cluded, is suggestive of collinearity between the 
trend term and the other variables measured in 
levels, since these also contain deterministic, as 
well as stochastic, trends. Pairwise correlation co- 
efficients confirm this. 

The problem with excluding ¢ is that the results 
become very sensitive to sample composition and 
model specification. In contrast when ¢ is in- 
cluded, the results are more robust, as shown in 
the second section of Table 3. Comparing these 
results to the Table 1, column (1), it seems that 
only the omission of p has any major effect. Wald 
tests confirm that both y and p should be included 
in nearly all of the underlying regressions. The 
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problem with simplifying the equations by exclud- 
ing a and r is that some of the models become 
very unstable, and as a result the average esti- 
mates become very sensitive to the cross section of 
countries used. It is therefore safer to include all 
of the variables, including ¢, due to their theoreti- 
cal importance, the cross-sectional stability of the 
estimates and the statistical properties of the 
models. 

Finally, we should consider the short-run dy- 
namics implied by the first order ADL, and also 
consider dynamic simplifications. Short-run dy- 
namics are given by the coefficients on the differ- 
enced variables in the reformulated version of 
(10). The coefficients on Ay, Ap, Aa and Ar, 
respectively, are 0.017, — 0.459, 1.581 and 1.036. 
Only the latter is statistically significant at fo 10, 
and unlike the long-run estimates, the short run 
effects are very sensitive to the cross section of 
countries used in the averaging process. Neverthe- 
less they do make sense intuitively, suggesting that 
the short-run effect of changes in income, public 
spending and age composition are small, or non- 
existent, compared with the long-run effect. How- 
ever, the short-run effect of relative prices is more 
Important than the long-run effect and suggests 
that in the short-run price increases are met by a 
one-for-one increase in health expenditure. 

Simplifications to the dynamic structure are 
tested, via Wald tests, as restrictions on Equation 
(10). The restrictions and results are summarized 
in Table 4. The static model, that has character- 
ized much of the literature, is rejected for all 
countries. The strictest ECM(1) that forces all 
coefficients to sum to 1, is rejected in only five of 
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the 20 cases. The less restrictive ECM(2) that 
restricts only the income elasticity (to 1), appears 
satisfactory for all countries. Although, there is 
now some conflict with the unrestricted model 
which predicts an income elasticity greater than 
one. Gerdtham’s [6] model, ECM(3), is rejected in 
nearly all cases. 

Estimates from the chosen ECM give an aver- 
age adjustment coefficient of — 0.221, which is 
statistically significant at f)9,. This suggests that 
countries take an average of about 5 years to 
adjust to long-run equilibrium, and this adjust- 
ment is slower than predicted by Saez and Murillo 
[19] for the EC countries, but similar to the 
findings of Gerdtham [6] for the OECD. The 
error correction term is sensitive to the cross 
section of countries used, but it is has the correct 
negative sign for all countries except Italy, and is 
statistically significant in 12 cases. This adds sup- 
port to the idea of an equilibrium relationship 
between y and A, despite contradictory results 
from the cointegration tests. 


CONCLUSION 


This paper has considered the difficulties of esti- 
mating elasticities of health care spending in the 
light of four factors. Firstly, the relationships 
involved are heterogeneous across countries. Sec- 
ondly, the relationships are characterized by dy- 
namic effects. Thirdly, the variables involved are 
non-stationary. Fourthly, theory gives very little 
guidance against which to judge the estimates. 


Table 3. Sensitivity analysis of mean group estimates 


Time trend excluded 


Exclusion of explanatory variables 


(1a) (1b) (2) (3) (4) 
y 0.948 1.263* 3.180% 2.103* 2.103* 
(1.877) (0.348) (0.882) (0.870) (0.383) 
p - 0.226 1.233 0.735 0.749* 
(6.715) (0.911) (0.488) (0.367) 
a 0.660 —0.720 =)288 — 1.120 
(4.217) (1.942) (1.484) (0.594) 
r —0.514 0.487 —0.117 0.315 
(2.542) (0.389) (0.571) (0.369) 


See notes to Table 1. 


(la) ADL model, (1b) as (1a) but excluding Italy. (2)-(4) include a time trend. 
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Table 4. Restriction tests on ADL mean group estimator 


Model Restriction No of rejections 
(from 20) 
Static 0 =%=f,=0,=)7,=4,=0 20 
Partial adjustment at, =f, =0,=7,=4,=0 10 
Growth rate tr =0, 4 = 1, Bo = —fy, 69 = — 9), Yo = —V1,, A0= 11 17 
ECM(1) oy =0, + Bot+Pi toot; +yotV t4t+4, = 1 5 
ECM(2) tr =0, 4+ fot+f,=1 0 
ECM(3)—Gerdtham [6] at, =0, 4+ Bot fh, =1, 69 = —O), Yo = —Y, 29 =4, =0 15 


These represent tests of restrictions on the ADL model (10) estimated for each of 20 OECD countries: h,, = %;+.0);4;,—-1+ 


2 ‘ . , 
ort + Bo Vit BiVin—1+ SoPie t+ O1Pi—1 + Yodiet Min —1 + Aoi t Alin 1 + bie 


Using techniques designed to cope with the first 
three of these factors, the results suggest that 
long-run elasticity estimates are sensitive to model 
specification, and in particular to the exclusion of 
a time trend. With a trend term, the long-run 
income elasticity of health spending is greater 
than one, but without it is approximately unity. 
However, the omission of the trend results in 
unstable parameter estimates and considerable 
sensitivity to the cross-section composition of the 
sample. It is unclear theoretically, whether a trend 
should be included in the estimating equations, 
and this theoretical weakness (the fourth factor) 
creates by far the biggest problems for estimation 
and evaluation. 

Attempts have been made here to compensate 
for the weak theoretical framework by increasing 
the emphasis on statistical and econometric crite- 
ria to judge the validity of the results. However, 
the sensitivity of the results to model specification 
and sample composition only serves to increase 
the need for a stronger theoretical base. Despite 
these theoretical problems, some results warrant 
further mention. Once again income has emerged 
as the primary correlate of health spending, and 
the tests of short-run dynamics seem to suggest 
that the long-run income elasticity is approxi- 
mately unity. Public spending and age composi- 
tion do not seem to have any short-run effects on 
health spending, but the proportion of public 
spending does have a significant positive long-run 
effect in the cointegrating model. Short-run 
changes in relative prices are important and result 
in a one-for-one increase in health spending. The 
relative price and public spending estimates both 
suggest that there is a role for policy in con- 
trolling health spending. 

The primary aim of future work must be in 
strengthening the theoretical base from which to 
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analyse health spending. This could build on the 
work of Dunne and Smith [42], Dunne et al. [43] 
and Selvanathan and Selvanathan [44], who 
analyse health spending within a macroeconomic 
framework of expenditure or consumption 
behaviour. 

Empirically the analysis could be improved by 
extending the data set to lower income countries, 
and attempts have already been made in this area 
[45,46]. In addition, while this paper has assumed 
that the regression coefficients are independently 
distributed across countries, it would also be use- 
ful to consider estimation techniques that com- 
bine parameter heterogeneity with cross country 
correlations and interaction effects. Finally, Bal- 
tagi and Griffin [47] have recently questioned the 
forecasting performance of the mean group esti- 
mator, particular in relation to long range fore- 
casts, and suggest that the efficiency gains from 
pooling could more than offset the biases due to 
heterogeneity. This issue should be investigated 
empirically within the context of health care 
expenditure. 
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